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ABSTRACT 

The high concentrations derived for several strong-lensing clusters present a major inconsis- 
tency between theoretical ACDM expectations and measurements. Triaxiality and orientation 
biases might be at the origin of this disagreement, as clusters elongated along the line-of-sight 
would have a relatively higher projected mass density, boosting the resulting lensing proper- 
ties. Analyses of statistical samples can probe further these effects and crucially reduce biases. 
In this work we perform a fully triaxial strong-lensing analysis of the 12 MACS clusters at 
z > 0.5, a complete X-ray selected sample, and fully account for the impact of the intrinsic 3D 
shapes on their strong lensing properties. We first construct strong-lensing mass models for 
each cluster based on multiple-images, and fit projected ellipsoidal Navarro-Frenk- White ha- 
los with arbitrary orientations to each mass distribution. We then invert the measured surface 
mass densities using Bayesian statistics. Although the Einstein radii of this sample are signifi- 
cantly larger than predicted by ACDM, here we find that the mass-concentration relation is in 
full agreement with results from A^-body simulations. The z > 0.5 MACS clusters suffer from 
a moderate form of orientation bias as may be expected for X-ray selected samples. Being 
mostly unrelaxed, at a relatively high redshift, with high X-ray luminosity and noticeable sub- 
structures, these clusters may lie outside the standard concentration-Einstein radius relation. 
Our results remark the importance of triaxiality and properly selected samples for understand- 
ing galaxy clusters properties, and suggest that higher-z, unrelaxed low-concentration clusters 
form a different class of prominent strong gravitational lenses. Arc redshift confirmation and 
weak lensing data in the outer region are needed to further refine our analysis. 

Keywords: galaxies: clusters: general -galaxies: clusters: individual: MACS z > 0.5 sample 
- methods: statistical - gravitational lensing 



1 INTRODUCTION 

The hierarchical cold dark matter model with a cosmological con- 
stant (ACDM) is highly successful in explaining many features 
of galaxy clust ers. The universal Navarro -Freank-White (NFW) 
density profile jNavarro et al] 1 19961 . [19971) reproduces well their 
density profile over most radii, bu t the actual mass-concentration 
relation, c(M), is sti ll debated jComerford & Nataraiai] l2007l : 
iBroadhurst et alj|2008l) . 

The concentration measures the halo central density relative 
to outer parts, and is known to correlate reversely with the virial 
mass and redshift, so that lower mass and lower redshift clusters 
woul d show higher concentrations ( iBullock et al.l200ll : lDuffv et al.l 
However, recent high-resolution simulations show differ- 
ent trends than have been expected before, with a turn-around and 
decrease in concentrations towards very high virial-mass clusters 
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jPrada et al]|201 ih . The cluster baryonic content is also expected 
to play a fundamental role in shaping the cluster overall density 
profile, although the baryonic physics is yet to be fully accounted 
for in related simulations, so that the explicit effects of cooling, 
feedback, and baryonic-DM interpl ay on the mass profile and con- 
centration are still ambiguous (e.g.. lGnedin et alj|2004l : IRozo et al.l 
I2OO8I : iDuffv et al.l 12010) . Cluster observations have also yet to 
firmly assess the properties of the c(Af ) relation, due to the small 
numbers of clusters analysed to date. Extensive lensing analyses 
done in recent years (e.g., Oguri et al. 200 9; Zitrin et al.^009b|, 
120101 ; lOkabe et alj2oTol : iRichard et all2010l : |Umetsuetalj|201^ 



and o ngoing cluster lensing surveys (e.g., CLASH, Postman et al.l 
I2OIII) should help in characterising further the observed relation. 

Similar discrepancies in lensing clusters between the ob- 
served properties and those predicted by ACDM h ave been re- 
porte d, such as the arc-abundance problem (Bartelmann et al.l 
Il998h . an d the detection of several extremely large Ein- 
stein radii l lBroadhurst & Barkanall2008l : lOguri & Blandfordll2009l; 



© 0000 RAS 



2 M. Sereno, A. Zitrin 



IZitrin etalJboiljlbh . In fact, these discrepancies are iiighly re- 
lated to the over-concentration problem, as there is a clear con- 
nection between the co ncentration and the Einstein radius (e.g., 
ISadeh & ReDhaelill2008h . or the lensing cross-section and the effi- 
ciency for generating giant arcs jHennawiet alj2007h . In addition, 
the existence of high-redshift massive clusters adds to this tension, 
and may imply a non-Gaussian distribution of massive perturba - 
tions dChongchitnan & Silkll2oTll : lD'Aloisio & Nataraiadl2011bh . 
or an earlier growth of structure than entailed by ACDM, so that 
clusters, sitting atop the hierarchical build-up, would have had more 
time to bind together and concentrate, increasing their inner mass 
and lensing properties. 

The disagreement between theory and observ ation might be 
explained by some orien tation and shape biases (Og uri et alj2005l : 
ISereno & Umetsull201 lb . In clusters that are elongated along the 
line-of-sight the projected matter density is relatively higher, an d 
the observed lensing properties are boosted jHennawi et al.ll2007h . 
Correspondingly, neglecting halo triaxiality can lead to over- and 
under-estimates of up to 50 per cent and a factor of 2 in halo 
mass and concentration, respectively dCorless et al.] l2009h . As- 
suming spherical sy mmetry also causes underestimation of sta - 
tistical uncer t ainties jCorless et alj|200^ : ISereno & Umetsull20TTh . 
ISereno et iD ( l2010al) investigated a sample of 10 strong lensing 
(SL) clusters considering which intrinsic shape and orientation the 
lensing halos should have to account for both theoretical predic- 
tions and observations. They found that nearly one half of the clus- 
ters seemed to be composed of outliers of the mass-concentration 
relation, whereas the second half supported expectations of A'^- 
body simulations which prefer mild ly triaxial lensing clusters with 
a strong orientation bias. Recently, [Sereno & Umetsul JioTlh per- 
formed a full triaxial weak and strong lensing analysis of A1689, a 
cluster usually seen as strongly over-concentrated. They found the 
halo to be slightly over-concentrated but still consistent with the- 
oretical predictions. They also found some evidence for a mildly 
triaxial lens with the major axis orientated along the line of sight. 

Here, we try to assess the impact of halo triaxiality and orien- 
tation on the mass-conce ntration relat i on of a properly defined sta- 
tistical sample. Recently, Izitrin et al] ( 1201 lah presented a detailed 
strong lensing analysis of a complete sample o f X-ray selected clus - 
ters, the MACS high-redshift cluster sample jEbeling et aLll2007h . 
This sample comprises 12 very luminous X-ray galaxy clusters 
at z > 0.5. IZitrin et al] ( 1201 lij) reported a difference of ~ 40 
per cent between the observed and the theoretical distributions 
of Einstein rad ii in standard ACDM ba sed on analytic models of 
galaxy clusters. [Meneghetti et alj ( r201lh extracted clusters from the 
MareNostrum Universe to build up mock catalogs of clusters se- 
lected through their X-ray flux according to the criteria applied to 
the z > 0.5 MACS sample. Their simulated distribution of the 
Einstein ring sizes is also lower by ~ 25 per cent than that found 
in observations. They also found that orientation biases together 
with triaxiality issues still affect the lenses with the largest cross 
sections, and that the concentrations of the individual MACS clus- 
ters inferred from the lensing analysis might be up to a factor of 
~ 2 larger than expected. They suggested that for a significant 
fraction of ~ 20 per cent of the clusters in the MACS sample, the 
lensing-derived concentrations should be higher than expected by 
more than > 40 per cent. 

It should also be noted that this high-z MACS sample appears 
to consist of mostly unrelaxed clusters. The high X-ray luminos- 
ity, along with the relatively high redshift and recent Sunyaev- 
Zeldovich (SZ) effect images (unpublished; private communica- 
tion), are in support of this claim, in addition to noticeable sub- 



struct ure seen in the opti cal and the aforementioned lensing anal- 
ysis dZitrin et al] 1201 lij) . The inner mass distribution of (most 
of) these clusters seem correspondingly rather widely-distributed, 
which may account for their extensive lensing properties and large 
Einstein radii. In fact, it is known that for e.g., relaxed, lower- 
z clusters, the Einstein radius correlates with the concentration 
dSadeh & RephaelilbOOSh . More concentrated clusters have more 
mass in the middle, thus boosting their Einstein radius. For higher- 
z, unrelaxed clusters, large Einstein radii may form due to a widely- 
distributed inner mass distribution, so that the critical curves of the 
smaller substructures near the core, merge together to form a much 
larger Einstein radius cu rve (e.g.. iTorri eFS] |2004| : | Fedeli et al.l 
I2OO6I : IZitrin et al.l l2009ij) . If this is indeed the case, for such a 
sample low concentratio ns and relatively shallow inner profiles 
should be expected (e.g.. iNeto et al.ll2007ll . suggesting that unre- 
laxed (higher-z) clusters form a distinct class of impressive strong- 
lenses. 

In order to probe the 3D intrinsic shapes of the clus- 
ters, we apply here the 3D strong-lensing analysis method of 
ISereno & Umetsul ( l201lh apart from a main difference. In fitting 
the projected surface density, we adopt a n on-parametric technique , 
as implemented in the PixeLens software dSaha & Winiamsl2004l) . 
instead of a parametric modelling (though initially the multiple- 
images were found using a parametric method, see below). As in 
ISereno & Umetsul ( 12011 ). we use Bayesian statistics to invert the 
measured projected surface mass-densities. An additional impor- 
tant difference is that here we use only strong-lensing data, without 
implementing full range weak-lensing data, as no such analyses are 
currently available for this sample. In that context, upcoming weak- 
lensin g analyses for these c lusters (e.g., as part of the CLASH pro- 
gram, IPostmanetalJI^oTll) will be important to further establish 
the r esults of our work. 

IZitrin et ai]( l2011ah , thanks to their minimalistic parametric 
approach to lensing and the low number of free parameters in 
their modelling, were able to find many multiple images across 
the z > 0.5 MACS cluster fields, which we use here as input. It 
should be noted, that although the mass distributions of these clus- 
ters (and resulting critical curves and Einstein radii) were credibly 
constrained therein, due to lack of redshift information for most of 
the lensed features, only broad constraints were put on the (pro- 
jected) mass profiles of the different clusters, following t he mod- 
els be st reproducing the different multiple im ages (see Zitr in et al] 
l2011ah . Here we use the lZitrin et alj ( I2OI lal) best-model predicted 
redshifts for the different arcs, in order to maintain consistency 
with their input mass distributions. These redshift estimates were 
found to be usually very accurate, in following spectroscopic (e.g.. 
Smith et al..2009.) and photometric (the CLASH program, in prepa- 
ration) measurements of some of these clusters. 

Since our strong lensing analysis with PixeLe ns exploited 
the im age positions and source redshifts inferred in IZitrin et al] 
( I2OI la ), our modelling is not alternative. The main advantage of 
re-analysing the strong lensing features with an independent non- 
parametric technique is that PixeLens allows us to fully explore 
degeneracies in the projected mass density. We can then reproduce 
the best-fitting models of Zitrin et al. (201 la) and obtain a reliable 
estimate of the covariance matrices, that could be then used for op- 
timising the deprojection procedure. 

The paper is organised as follows: in §|2]we detail the basics of 
a triaxial NEW description, whereas in §|3]we discuss the theoret- 
ical predictions for the results. In §|4]the 12 high-2; MACS sample 
is described, and in §|5]the strong-lensing analysis procedure is laid 
out, followed by the deprojection procedure in §[6] The results are 



© 0000 RAS, MNRAS 000, 000-000 



Concentrations and orientations of the z > 0.5 MACS sample 3 



detailed in §[7] In §[8]we review some systematics. Results are con- 
cluded in §[9] Throughout the paper, we assume a flat ACDM cos- 
mology with density parameters SIm — 0.3, f^A = 0.7 and Hubble 
const ant Ho = lOO/i kms"^Mpc"\ h = 0.7 jKomatsu etaEI 
l20Tlh . 



2 BASICS ON TRIAXIAL NFW HALOS 

High resolution A^-body simulations have shown that the density 
distributions of dark matter halos are successfully described as tri- 
axial ellipsoidal NFW density profiles with ali gned, concentric axes 
dNavarro et al.ll996Lll997l : ljing & Sutol2002h . The 3D distribution 
follows 



pNFW ~ 



(C/rs)(l + CAs)2' 



(1) 



where ( is the ellipsoidal radius. The shape is determined by the 
axial ratios. The minor (intermediate) to major axial ratio is denoted 
as qi (92) with < qi 5S 52 ^ 1; we also use the inverse ratios, 
< ei = 1/qi ^ 1. 

Three Euler's angles, 6, tp and tp determine the orientation of 
the halo. 9 and tp fix the orientation of the line of sight in the intrin- 
sic system. In particular, 6 quantifies the inclination of the major 
axis with respect to the line of sight. The third angle tp determines 
the orientation of the cluster in the plane of the sky. 

The radius r2oo for an NFW spheroid can be defined such that 
the mean density contained within an ellipsoid of semi-major axis 
r2oo is A = 200 times the critical density at the halo redshift, Pc i 
dCorless et al]|2009l : ISereno et al.ll2010j : ISereno & Umetsj|201 ih : 
the corresponding concentration is C200 = ?'20o/''s. A/200 is 
the mass within the ellipsoid of semi-major axis r2oo, M200 = 
(8007r/3)<jig2r2ooPcr. 

The ellipsoidal 3D NFW halo projects on an elliptical 2D 
profile. The projected surface density profile has the same func- 
tional form of a spherically symmetric profile (Stark 1977; Serene 
I2OO7I : ISereno et alj|2010tf ). The convergence k, i.e., the surface 
mass density in units of the critical surface mass density for lens- 
ing, Scr ~ (c^ Db) / {^TvG Dd -Dds), where Db, Dd and Dds are 
the source, the lens and the lens-source angular diameter distances 
respectively, for an ellipsoidal NFW halo is 



larccosh 



X is the dimensionless elliptical radius, 



2x,l/2 



(2) 



(3) 



where e is the ellipticity and xi and X2 are the abscissa and the 
ordinate in the plane of the sky oriented along the the ellipse axes, 
respectively. 

The ellipticity and the orientation of the projected ellipses de- 
pend only on the intrinsic geometry and orientation of the system. 
The axial ratio of the major to the minor axis of the observed pro- 
jected isophotes, Ep — (l~e)~^. can be written as ( iBinggeli.1980 ). 



J + 1+ VU - 1? + 4fc^ 



where j, k and I are defined as 



(4) 



22.2, 
J = 6162 sm ( 



2 2 2 2 2 2 

61 cos 6 COS </p + 62 cos 9 sin ip, (5) 



(fii ~ 62 ) sin <p cos (p cos ( 



I = €1 sin^ if + €2 cos^ ip. 



(6) 
(7) 



The angle tp is strictly connected to the orientation angle in 
the plane of the sky. When the coordinate axes are aligned with 
the ellipse axes, is the angle between the axes of the observed 
ellipses and the projection onto the sky of the ellipsoid third axis 
( iBinnevll 19851 : ISerendl2007h , 



jp — — arctan 



2k 



(8) 



The strength of the lens Ks and of the projected length scale 
rsp are directly related to the intrinsic para meters M200 and C20 
and can be inferred by fitting lensing maps jSereno et alj|2010bh . 
The observed scale length Vsp is the projection on the plane o f the 
sky of the cluster intrinsic length Vs l lStarkll977l : rSerendl2007h . 



where / is a function of the cluster shape and orientation, 

2 2 2 2 2 2 2 

/ = 61 sin 9 sin p? + e2 sin 9 cos ip + cos 9; 



(9) 



(10) 



the parameter ca qua ntifies the elon gation of the triaxial ellipsoid 
along the line of sight ( ISerendl2007h . 



\ 1/2 

,6162 / 



(11) 



6a is the ratio between the major axis of the projected ellipse in the 
plane of the sky and the size of the ellipsoid along the line of sight. 
It quantifies the elongation of the triaxial ellipsoid along the line of 
sight. If eA < 1, then the cluster is more elongated along the line 
of sight than wide in the plane of the sky, i.e., the smaller eA, the 

larger the elongation along the line of sight. 

The lensing strength can be expressed as dSereno et alj2010bh . 



_ 1 Ps^sP 

^ Z ' 

2jcr eA 

where as usual ps = Scpcr, 
200 C200 



3 ln(l + C200) - C2oo/(1 + C200) ' 

The mass A/2oo can be expressed as 

47r -J 1 

M200 = ^ X 200pcr X (c2oor-sp) . 

3 epeA 



3 THEORETICAL PREDICTIONS 



(12) 



(13) 



(14) 



A^-body simul a tions jNeto et al.[ l2007l: iMaccio et alj l2008l : 
iGao et al.l I2OO8I : iDuffv et al.l I2OO8I : |Pradaetal.ll201lh have pro- 
vided a quite detailed picture of the expected properties of dark 
matter halos. Results may depend on parameters such as the overall 
normalization of the power spectrum, the mass resolution and the 
simulation volume (Prada et ak 201 IJ. The dependence of halo 
concentration on mass and redshift can be adequately described by 
a power law, 

c = ^(M/Afpivot)''(l + 2)'^. (15) 

As reference, we follow iDuffv et al.l ( |2008|) . who used the cos- 
mological parameters from WMAP5 and found {A,B,C} — 
{5.71 ± 0.12, -0.084 ± 0.006, -0.47 ± 0.04} for a pivotal mass 
Afpivot = 2 X lO^^Af0//i in the redshift range — 2 for their 
full sample of clusters. The scatter in the concentration about the 
median c(A/) relation is lognormal. 
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Table 1. Properties of the z > 0.5 MACS sample. Data in columns 1-8 are based on lEbeling et all )2007l) . Column 1: cluster name in the MACS survey; 
Columns 2 & 3: The RA and Declination of the X-ray centroids (as determined from Chandra ACIS-I data); Column 4: redshifts; Column 5: velocity 
dispersions, in km s~^; Column 6: Chandra X-ray luminosities in the 0.1-2.4 keV band. These luminosities are quoted within r2oo and exclude X-ray 
point sources; Column 7: X-ray temperatures, m easured from Chandra data within riooo, but excluding a central region of 70 kpc radius around the listed 
X-ray centroid; Column 8: lEbeling et alj ^200'^ morphology code - assessed visually based on the appearance of the X-ray contours and the goodness of 
the optical/X-ray alignment. The assigned codes (from apparently relaxed to extremely disturbed) are 1 (pronounced cool core, perfect alignment of X-ray 
peak and single cD galaxy), 2 (good optical/X-ray alignment, concentric contours), 3 (nonconcentric contours, obvious small-scale substructure), and 4 (poor 
optical/X-ray alignment, multi ple peaks, no cD ga laxy), errors are estimated as less than 1. Note that indeed most clusters do not seem relaxed; Column 9: 
Einstein radius estimation from lzitrin et al.l i201 13) . for a source at 2 ~ 2 ± 0.5. Note, the clusters MA CS J0018.5-I-1626 aii d MACS J0454. 1-0300 are better 
known from earlier work as C10016-H1609 and MS 0451.6-0305, respectively. For more information see lEbehng et aljj2007h . 



MACS 


a 


5 


z 


(J 


-^x, Chandra 


KT 


M.C.E. 


9e 




[J2000.0] 


[J2000.0] 




[kms-i] 


[lO^^ergs-l] 


[keV] 







J0018.5-H1626 


00 18 33.835 


-H16 26 16.64 


0.545 


1420+"" 


19.6 ±0.3 


9.4 ± 1.3 


3 


24 ± 2 


J0025.4-1222 


00 25 29.381 


-12 22 37.06 


0.584 


740+90 


8.8 ±0.2 


7.1 ±0.7 


3 


30 ±2 


J0257. 1-2325 


02 57 09.151 


-23 26 05.83 


0.505 




13.7 ±0.3 


10.5 ± 1.0 


2 


39 ±2 


J0454. 1-0300 


04 54 11.125 


-03 00 53.77 


0.538 


1250li^3° 


16.8 ± 0.6 


7.5 ± 1.0 


2 


13+1 


J0647.7-I-7015 


06 47 50.469 


-H70 14 54.95 


0.591 


qnn+120 


15.9 ±0.4 


11.5 ± 1.0 


2 


28 ±2 


J0717.5-H3745 


07 17 30.927 


-H37 45 29.74 


0.546 


1660ti^o 


24.6 ± 0.3 


11.6 ±0.5 


4 


55 ±3 


J0744.8-H3927 


07 44 52.470 


-1-39 27 27.34 


0.698 


1110+130 


22.9 ± 0.6 


8.1 ±0.6 


2 


31 ± 2 


J0911.2-H1746 


09 11 11.277 


+17 46 31.94 


0.505 


1150+2^° 
iJ^3'J_340 


7.8 ±0.3 


8.8 ±0.7 


4 


111? 


J1149.5-H2223 


11 49 35.093 


+22 24 10.94 


0.544 


1840^170 


17.6 ± 0.4 


9.1 ±0.7 


4 


27 ±3 


J1423.8-H2404 


14 23 47.663 


+24 04 40.14 


0.543 


1300+17° 


16.5 ± 0.7 


7.0 ±0.8 


1 


20 ±2 


J2129.4-0741 


21 29 26.214 


-07 41 26.22 


0.589 


14001^^0 


15.7 ±0.4 


8.1 ±0.7 


3 


37 ±2 


J2214.9-1359 


22 14 57.415 


-14 00 10.78 


0.503 


1300l?°o 


14.1 ± 0.3 


8.8 ±0.7 


2 


23 ±2 



p(lnc|M) = 



exp 



Inc - Inc(Af) 



(16) 



with a dispersion cr(lo Kiri C200) = 0.15 for a full sa mple of clus- 
ters touffv et al.ll2008l) . Recently. [Prada et alj JloTTI) claimed that 
the dependence of concentration on halo mass and its evolution can 
be obtained from the root-mean-square fluctuation amplitude of the 
linear density field. They noticed a flattening and upturn of the rela- 
tion with increasing mass and estimated concentrations for galaxy 
clusters substantially larger than results reported in Eq. (15}. 

The distribution of min or to major axis ratios can be approxi- 
mated as jjing & Sutdl2002l) . 



p(gi) OC exp 



where — 0.54, CTs = 0.113 and 



(17) 



(18) 



with M* the characteristic nonlinear mass at redshift z and Afvir 
the virial mass. The conditional probability for q-z goes as 



p{qi/q2\qi) = 



2(1 -r„,in) 



2gi /172 — 1 - J-min 



1-rn 



(19) 



for qi/q2 1? ^'min = max[gi,0.5], whereas is null otherwise. The 
axial ratios distribution of the lensing populat ion mimics that of the 
total cluster population jHennawi et al.|[2007h . 

For the orientation, we considered a population of randomly 
oriented clusters with 



p(cos6) = 1 

for ^ cos S ^ 1 and 

p(v?) = - 



(20) 



(21) 



for — 7r/2 ^ Lp ^ 7r/2. The distribution for -i/; is the same as 
that for Lp. For compar ison, we also considered a biased popu- 
lation. Semi-an alytical ( O guri & Blandfordlioog l) and numerical 
dHennawi et~^ [2007) investigations showed a large tendency for 
lensing clusters to be align ed with the li ne of sight. This condition 
can be expressed as ( Corless et alj2009l) 



p(cos OC exp 



(cos0 - 1)^ 



2^1 



(22) 



A value of ae = 0.115 can be representative of the orientation bias 
for massive strong lensing clusters. 



4 THE MACS HIGH-REDSHIFT CLUSTER SAMPLE 

The Massive Cluster Survey (MACS) supplies a co mplete sample 
of the very X-ray luminous clusters in the Universe jEbeling et al.l 
I2OOII l2010i) . From this catalog, a complete sample of 12 high-z 
MACS clusters (z > 0.5, see Table[T]l was defined bv lEbeling et al.l 
( l2007l) . which have proved very interesting in sever al follow-up 
studies including deep X-ray, SZ and HST imaging (see lZitrin et al.l 
2011a, and references therein ). In particular, here we use the 
multiple-images identified in Z itrin et al.l ( l2011ah as input for our 
non-parametric mass reconstruction. 

Although the MACS clusters are not selected by their lens- 
ing features and therefore are not biased in that sense, they are ex- 
pected to comprise some of the most massive clusters in the Uni- 
verse, and thus can be expected to produce i mpressive lenses. As 
also discussed above in § [T] by a SL analysis Izitrin et al] 1 I2OI lah 
deduced the distribution of Einstein radii for this sample, which 
are found to be noticeably larger than expected by ACDM (see also 
[Meneghetti et al. 201 1), even after taking lensing bias into account. 

Here, we deduce the triaxial 3D mass distribution of these 
clusters, in order to examine whether the c{M) relation of this sam- 
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Figure 1. Pixelated map of the mean convergence of MACSJ0018 as de- 
rived with PixeLens. We considered for the analysis only the pixels con- 
tained between the inner and the outer thick line. The thin line plot the 
NFW profile corresponding to the best fit (contours runs from 2.2 to 1.4 in 
steps of 0.2). North is up. 



Table 2. Pai'ameters set in the PixeLens analysis. Columns 2 and 3 report 
the radius of the circular region investigated in pixels and arcseconds, re- 
spectively. Column 4 lists the pixel resolution. 



MACS radius resolution 

[pixels] ["] [kpcft-l] 



J0018 


14 


30.0 


9.6 


J0025 


14 


95.0 


31.4 


J0257 


14 


38.0 


11.7 


J0451 


14 


45.0 


14.3 


J0647 


14 


80.0 


26.6 


J0717 


15 


135.9 


40.5 


J0744 


14 


80.0 


28.6 


J0911 


14 


20.0 


6.1 


J1149 


15 


65.0 


19.3 


J1423 


15 


38.0 


11.3 


J2129 


14 


40.0 


13.3 


J2214 


14 


45.0 


13.8 



Table 3. Observed projected NFW parameters of the z > 0.5 MACS sam- 
ple. Columns 2, 3, 4, and 5 list the lensing strength /ts, the projected radius 
Tgp, the ellipticity e and the orientation angle 9^. Central values and dis- 

i iersions are comp uted as bi-weight estimators of the marginalised PDFs 
Beers et all 1990) . 6»e is measured North over East. 



MACS 


Kg 




e 


9e 






[kpc] 




[deg] 


J0018 


0.29 ± 0.06 


2200 ± 1300 


0.32 ±0.12 


-50± 15 


J0025 


0.26 ± 0.04 


8100 ±4500 


0.21 ±0.09 


-25 ± 24 


J0257 


0.37 ± 0.06 


1200 ±500 


0.12 ±0.08 


2±45 


J0451 


0.28 ± 0.05 


2700 ± 1500 


0.20 ±0.10 


6±46 


J0647 


0.28 ±0.01 


3500 ±400 


0.07 ± 0.02 


11 ±36 


J0717 


0.26 ± 0.07 


5700 ± 4000 


0.07 ± 0.04 


-58 ±29 


J0744 


0.74 ± 0.03 


2000 ± 300 


0.59 ± 0.02 


-0.6 ± 1 


J0911 


0.39 ±0.19 


850 ±780 


0.29 ±0.10 


-17 ±34 


J1149 


0.23 ± 0.02 


3900 ± 1100 


0.10 ±0.03 


-65 ± 17 


J1423 


0.29 ± 0.07 


1700 ± 1000 


0.34 ± 0.09 


-25 ±8 


J2129 


0.31 ±0.06 


3700 ± 2600 


0.07 ± 0.04 


-2 ±40 


J2214 


0.29 ± 0.07 


2000 ± 1200 


0.32 ±0.10 


54 ±20 
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Figure 2. Contour plot of the marginalised PDF for the strength Ks and the 
projected radius r^p for MACSJ0018. Contours are plotted at fraction val- 
ues exp(— 2.3/2), exp(— 6.17/2), and exp(— 11.8/2) of the maximum, 
which denote confidence limit region of 1 , 2 and 3(t in a maximum likeli- 
hood framework, respectively. 



pie suffers the same discrepancy as found for the Einstein radii. 
In this context, this high-z MACS sample is of unique interest. 
Being at a relatively high redshift, and mostly consisting of un- 
relaxed clusters (see also Table [T}, this sample cannot be expected 
to follow the usual Eins tein radius - concentration relation (e.g., 
ISadeh&Rephaelill2008h . Instead, it can shed more light on the 
relaxation epoch of massive clusters, and show that, if concen- 
trations are low, unrelaxed (and mostly higher- 2:) clusters indeed 
form a different class of prominent gravitational lenses (than well- 
relaxed and concentrated lower-z cl i jsters), as previously implied 
jTorri et al.ll2004 IPedeli et alibood ; iNeto et alJlioOTi : Izitrin et all 
l2011bh . 



5 STRONG LENSING ANALYSIS 

We performed strong lensing analyses of the inner regions of the 
high redshift MACS sample. A detailed desc ription of the stron g 
lensing features of the sample can be found in lZitrin et al.l ( l2011ah . 
We adopted a two step procedure. Firstly, we obtained the surface 
mass density for each cluster. Secondly, we modelled the map with 
a projected NFW profile. 

We obtained a pixelated map of the surfac e mass density of 
each cluster by use of the PixeLens software jSaha & Williamsl 
|2004 . PixeLens generates a statistical ensemble of models which 
can exactly reproduce the positions and parities of all multiple- 
image. The ensemble automatically explores degeneracies in con- 
vergence and provides uncertainties. For each cluster, we computed 
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200 convergence maps. Parameters set to run the code are listed 
in Table |2] For each lens, we analysed a circular area centred on 
the brightest cluster galaxy (BCG). The external radii are listed in 
Table |2j they extend just behind the outermost image. These cri- 
teria select a region larger than that enclosed by the Einstein ra- 
dius, see Tables [T] and [2] The effectiveness of PixeLens is conse- 
quently poorer in the outer pixels. We then excised these pixels as 
explained in the following. In determining the optimal pixel res- 
olution we had to counterbalance two competing effects. On one 
side, a more refined grid of pixels allows for a better constraining 
of small scale structures. On the other side, a too large number of 
pixels slows the code to a not sustainable extent. It can also induce 
over-sampling effects. From the analysis of lensing in simulated 
systems, we found that a radius of ~ 14 — 15 pixels gives the best 
compromise. The explicit resolutions are listed in Table[2l 

PixeLens may offer a poor description of the mass profile 
in the very outer pixels, with an unphysical sharp drop in den- 
sity just outside the Einstein radius. We preliminary computed the 
slope profile of the surface density in spherical annuli centred in 
the denser pixel and checked for logarithmic density slope values 
smaller than -2, the minimum allowed slope for a projected NEW 
profile. We required at least two consecutive annuli with slope 
lower than the threshold. We fixed the minimum convergence at the 
averaged convergence of the first spherical region with unphysical 
slope and excluded from the analysis pixels with convergence lower 
than the minimum. With the aid of simulated images, we checked 
that this cutting procedure fares very well. In the radial span of just 
a couple of pixels near the Einstein radius, the logarithmic slope 
suddenly falls from > — 1 in the internal region (where ^ ^ rsp) 
to values < — 3 or even steeper. We retrieved this feature, which 
is artefact of PixeLens, in both simulated or observed clusters. We 
checked that the value of the minimum allowed convergence was 
negligibly affected by the use of different cutting criteria, such as 
either a different minimum value for the slope (> —1.5) or by re- 
quiring that the allowed slope variation between adjacent annuli 
was smaller than a maximum difference. 

We also excluded the central pixels. IUmetsuetal1 ( l201lj|bh 
found a small offset of typically < doff = 20 kpch~^ between 
the BCG and the dark matter centre of mass recovered from strong 
lens modeling. Even if in PixeLens the centre position is left free 
and only a preliminary geometrical centre (usually coinciding with 
the position of the BCG) is used, we conservatively limited our 
analysis to radii greater than 2 doff from the pixel with the maxi- 
mum convergence, beyond which the cluster miscentering effects 
are negligible. For z > 0.5, this criterium usually cuts only the 
very central denser pixel. The exclusion of the central pixels is also 
conservative with regard to the effects of baryons on the total mat- 
ter profile. In fact, in the investigated range the total distribution is 
dominated by dark matter, whose behavior is very well modelled in 
numerical simulations. An example of pixels which passed the cuts 
is plotted in Fig.[T] 

As a second step, we modelled the pixelated maps with a pro- 
jected NFW profile. We looked for the minimum of 

X = ^2 ~ KNFw(a;l,a;2;6'o.i,^'o,2,Ks,rsp,e, (23) 

i 

where the sum runs over the pixels and ki is the measured conver- 
gence of the i-th pixel of coordinates x\,x\; knfw is the theoret- 
ical prediction. Each pixel was given the same weight. We fitted 
6 free parameters: the halo centre coordinates, So.i and ^0,2; the 
strength Ks(> 0) and the projected radius rsp(> 0); the ellipticity 



Table 4. NFW intrinsic parameters of the 2 > 0.5 MACS sample as ob- 
tained from the Bayesian deprojection. Columns 2, 3 and 4 list the val- 
ues of mass, concentration and (cosine of) inclination angle, respectively. 
Central values and dispersions are computed as bi-weight estimators of the 
marginalised PDFs. 



MACS 


Ajf200 

[10i5AfQ/h] 


C200 


cos Q 


J0018 


4.1 ±2.7 


1.9 ±0.9 


0.59 ±0.29 


J0025 


4.3 ±3.2 


1.6±1.1 


0.73 ± 0.27 


J0257 


2.7 ± 1.7 


2.7 ± 1.1 


0.76 ± 0.26 


J0451 


4.8 ±2.5 


1.6 ±0.7 


0.74 ±0.25 


J0647 


6.8 ±1.4 


1.0 ±0.2 


0.81 ±0.12 


J0717 


4.5 ±3.0 


1.5 ±1.2 


0.57 ±0.20 


J0744 


8.0 ±1.3 


3.1 ±0.5 


0.38 ±0.23 


J0911 


1.8± 1.7 


3.6 ±2.2 


0.57 ±0.30 


J1149 


5.1 ±1.9 


0.9 ±0.3 


0.85 ±0.11 


J1423 


2.9 ±2.2 


2.0 ±0.9 


0.59 ±0.29 


J2129 


3.5 ±3.1 


1.7± 1.1 


0.80 ±0.12 


J2214 


3.2 ±2.8 


2.3 ± 1.2 


0.61 ±0.29 



(0 <)e(^ 1) and the orientation angle Q^. In Fig. [T] we plot the 
result of such fitting procedure for a mean convergence map. 

We carried out the fitting procedure for each map. From the 
derived ensemble of maximum likelihood parameters, we obtained 
the posteriori distribution of projected NFW parameters. For each 
map, we considered only the best-fit parameters. Since we fitted 
each map separately, correlation effects and convergence stability 
for each pixel were not considered at the level of Eq. J23b . These 
effects were taken into account later by grouping the sets of maxi- 
mum likelihood parameters. An example of fitting results is plotted 
in Fig.[T] 

Our approach is alternative to standard ones which constrain 
NFW elliptical profiles by directly fitting multiple image systems 
with parametric models. Due to the presence of local substructures, 
such a method might underestimate parameter errors. On the other 
hand, we fitted the 2D mass distribution rather than the image po- 
sitions. Lensing is highly non-linear. Attempting to directly fit the 
multiple images positions with an NFW model can then give differ- 
ent results from those obtained by reproducing the image positions 
with a pixelated map and then fitting the map with the same para- 
metric model. This is more similar to NFW fitting in numerical 
simulations, where the 3D mass density is directly fitted. 

Fitting results for the relevant parameters are summarised in 
Table [3] The very large values for r^p and the related very large 
errors reflect the inability of the fitting procedure to effectively 
constrain the scale length. Strong lensing focuses on the very in- 
ner regions whereas a precise determination of rsp would require 
a coverage up to larger radii, as in weak lensing analyses. Degen- 
eracies effects make even very large values of rsp compatible with 
multiple image positions. As a consequence, the central momentum 
of the distribution is pushed to large values too. This can be seen in 
the marginalized PDF distribution for and rsp, see Fig.|2] Most 
likely values for r^p are at small values but the tail at large values 
is really extended. The large uncertainty on rsp will carry over into 
a large uncertainty on halo mass and concentration. 

6 DEPROJECTION 

Deprojecting a surface density map to infer the intrinsic 3D shape 
is an under-constrained astronomical problem. Apart from 3 coor- 
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dinates fixing the halo centre, the 3D NFW halo features seven pa- 
rameters: the density profile is described by two parameters (AI200 
and C2Q0), the shape by two axial ratios (qi and 52), the orientation 
by three angles, 6 and (p and -0. On the other hand, since lensing 
is dependent on the projected mass density, from lensing observa- 
tions we only get four constraints relating the intrinsic parameters: 
the lensing strength Ks, the projected radius Vsp, the ellipticity e 
and the orientation 6c. Even combining multi- wavelength observa- 
tions, from X-ray through the optical and to radio bands, one can 
only constrain the elongation of the cluster along the line of sight 
( ISerendl2007h . 

To assess realistic probability distributions for the intrin- 
sic parameters we performed a statistical Bayesian analysis. The 
use of some a priori hypotheses on the cluster shape can help 
to disentangle degeneracies. We applied som e methods already 
employed in g ravitation al lensing an alyses jOguri et all I2OO5I : 
Corlessetal 120 09: Sere no et al .l2010ah . n particular, we followed 
Sereno & Umetsui ( i2011i) . where further details on the method can 
be found. The Bayes theorem states that 

p(P!d) cx L{P\d)p(I>), (24) 

where p(P|d) is the posterior probability of the parameters P 
given the data d, £(P\d) is the likelihood of the data given the 
model parameters, and p(P) is the prior probability distribution for 
the model parameters. 

Using the functional dependence of the projected parameters 
on the intrinsic ones, the likelihood for the intrinsic parameters is 
the likelihood C{ks, r^p, e, 6^) derived in Sec.|5] This distribution 
was smoothe d using a Gaussian kernel estimator jVio et al.|[T994l : 
lRvderj|l996h . 

As prior for the axial ratios gi and q2, we considered the A'^- 
body predictions in Eqs. ( |17lll9t . We always put a lower bound 
q\ ^ 0.1. For the alignment angle 9 and the the azimuth an- 
gle ip, we considered a random distribution. For the mass, we al- 
ways used a flat prior p(M2oo) = const, in the range 10^'^ ^ 
M2oo/(M0/i"^) ^ lO^'', whereas the prior for the concentra- 
tion was flat in the range < C200 ^ 30 and null otherwise. 
Posterior PDFs were computed by running four Markov chains. 
We checked for chain convergence by verifying that the standard 
var(chain mean)/mean(chain var) indicator was less than 1.1. We 
computed at least 10000 samples per chain and eventually added 
ten thousands more until the convergence criterium was satisfied. 

Results are sunmiarised in Table |4l Central location and dis- 
persion for each parameter are computed as biweight estimators 
of the corresponding ma rginalised posterior probability function 
(PDF) ( lBeersetal.lll990l) . Interesting results can be obtained on 
the mass, concentration and elongation of the clusters. Since we 
exploited only strong lensing data in the inner regions, we can- 
not determine the virial halo mass accurately. Our typical error on 
A/200 is then > 60 per cent. This reflects the large uncertainty in 
the projected scale radius, discussed in Sec. |5] A better determi- 
nation would require either weak lensing data at larger radii or a 
much larger number of multiple image systems. The statistical un- 
certainty in the concentration measurement is large too (< 50 per 
cent), but better than the mass accuracy. 

Dealing with a triaxial analysis, we have five more parame- 
ters (three orientation angles and two axial ratios) with respect to a 
spherical analysis. The errors on M200 and C20 are consequently 
larger than when assuming spherical symmetry jSereno & Umetsu 
I2OI ih . Elongation can be somewhat constrained since it is directly 
coimected to the lensing strength. On the other hand, the PDFs of 
the intrinsic axial ratio gi and 52 and of the angle are dominated 
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Figure 3. Estimated mass and concentration of the 2 > 0.5 MACS clus- 
ters versus theoretical predictions. The full and dashed thin lines denote 
the central value and the Icr p redictions, respectively, for the full sample of 
clusters in Duffy et al The redshift of the simulated cluster popula- 

tion is fixed at the average reds hift of the observe d sample, z ~ 0.56; the 
thick line is the prediction from lPrada et alj j201lh . 
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Figure 4. Effective Einstein radii versus concentrations for the 2 > 0.5 
MACS clusters. The full line is the theoretical prediction for unrelaxed clus- 
ters at 2 ~ 0.56. Concentrations for relaxed clusters are expected to be 
signiticantly higher. 

by the a priori assumptions. Our strong lensing data are not con- 
straining in this regard and we do not discuss them. The posterior 
distribution of tp just reflects the likelihood on 9^ and is not signif- 
icant in this context. 

The final balance is that, with respect to a standard spherical 
analysis, we can constrain a third parameter, i.e., the elongation, at 
the expense of a poorer accuracy on halo mass and concentration. 
Our Bayesian analysis also cuts unphysical models, such as those 
corresponding to extremely large values of Vsp which are still rep- 
resented by the likelihood. 



7 RESULTS 

7.1 Mass and concentration 

The estimated mass-concentration relation for the high redshift 
MACS sample is plotted in Fig.|3] Our strong lensing analysis could 
naturally not determine the total cluster masses very accurately. 
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whereas the constraints on the concentrations were slightly tighter. 
However, the over-concentration problem can be addressed even 
if uncertainties are large. We found out that the z > 0.5 MACS 
clusters are very massive as expected (A/200 IO^'^A/q/Zi), with 
quite low concentrations C200 ^ 3. Usually, strong lensing se- 
lected clusters with mass of this order were found to be over- 
concentrated to the extent of C200 ~ 10. Even with a quite large 
error of 5c200 ^ 1.2 we can then distinguish clusters in agree- 
ment with the theoretical c(A/) relation from outliers. Furthermore, 
most of the theoretical studies on the c(Af) relation pre dict a quite 
shallow dependence of the concentration on the mass 1 Neto et al] 
I2OO7I : iMaccio etalj|2008l ; lOao et al.ll20o3 : iDuffv et al ll2008l) . so 
that comparison with our results can be meaningful even with large 
uncertainties 5A/200 . However, large errors prevent us from deter- 
mining the slope in the observed c(M) relation. 

We find a noteworthy agreement between observations and 
predictions for the MACS sample. All of the measured concentra- 
tions are compatible within Icr with the predicted values. A small 
decrement of concentrations with increasing masses seems to be in 
order, but larg e errors asks for c aution. 

Recentlv. lPrada et"al] j201 ll) noticed a flattening and upturn of 
the relation with increasing mass and redshift. Predicted concen- 
trations for the MACS sample would be significantly higher, see 
Fig. [3] In that case, our results would pose a under-concentration 
problem. 

The concentration of relaxed, l ower-z clusters, is expected 
to correlate with the Einstein radius jSadeh & Rephaeli| [2008). In 
fact, more concentrated clusters should have more mass in the 
middle, thus boosting their Einstein radius. We did not recover 
this trend in our higher redshift sample, see Fig. |4] The overplot- 
ted theoret ical c(0e) r e lation has been obtained from the c{M) 
derived in iDuffv et alj ( l2008h by exploiting the expected trend 
between mass es and Eins tein radii found for unrelaxed clusters 
(Se oc Afom. ISroadhurst & B arkana 2008). Strong lensing anal- 
yses can measure very accurately the Einstein radius, whereas the 
determination of the concentration is less precise. Large errors in 
C200 might then hide the real trend in the c(Se) relation for the 
z > 0.5 MACS sample but apparently here the Einstein radius 
is not larger for smaller concentrations, as expected for unrelaxed 
high redshift massive halos. 
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Figure 5. Theoretical probability density functions for the orientation 
(cos 9) of randomly oriented (full line) or biased (a^^^ q = 0.115, dashed 
line) A'^-body like clusters versus measured values for the 2 > 0.5 MACS 
sample (normalized histogram). The predicted distributions were convolved 
with a Gaussian distribution with dispersion of ~ 0.23. 




O" cos 6 

Figure 6. Kolmogorov-Smimov significance level that the orientations of 
the z > 0.5 MACS sample are drawn from a biased population of clusters 
with dispersion a^^^ g . The predicted distributions were convolved with a 
Gaussian function with dispersion of ~ 0.23. 



7.2 Orientation 

Very low values of the orientation angles 9 for the MACS sam- 
ple would suggest an orientation bias. Measured values are listed 
in Table |4] Predicted versus observed values are plotted in Fig. [5] 
To account for the observational uncertainty, theoretical predictions 
were convolved with a Gaussian function with dispersion equal to 
the average uncertainty on cos 9 (~ 0.23). With respect to a ran- 
domly oriented sample, there is an excess at low orientation angles, 
COS.9 > 0.5. The more inclined clusters (cos9 > 0.8) are those 
which appear nearly circular in the plane of the sky (e < 0.1). 

The Kolmogorov-Smirnov (KS) test can provide further in- 
sight. Neither of the two extreme cases considered (strong lensing 
orientation bias with CTcosS = 0.115 or random orientations) pro- 
vide a very good description of the data but the strongly biased 
case is slightly preferred. The KS significance level is of 0.9 per 
cent for the biased case or 0.4 per cent for the random case. To as- 
sess the size of any orientation bias, we run the KS test for different 
values of dcose, see Fig. [6] The significance level is maximum for 
o"cos e =0.4, suggesting that a mild form of orientation bias affects 
the X-ray selected MACS sample. 



7.3 Elongation and ellipticity 

A combined analysis of elongations and ellipticities can provide 
further information on the orientation bias. Estimated elongations 
are listed in Table|5] The observed versus the predicted values of the 
elongations are plotted in Fig. |7] The cumulative distributions are 
plotted in Fig. [8] As theoretical distributions we considered a pop- 
ulation of clusters with TV-body like axial ratios. The masses and 
the redshifts of the theoretical distribution are fixed to the average 
values of the z > 0.5 MACS sample. As before, we considered two 
cases for the orientation: either randomly oriented lenses or clusters 
strongly inclined toward the line of sight ((Jcose = 0.115). Theo- 
retical distributions were smoothed to account for observational un- 
certainties by convolving with a Gaussian function with dispersion 
equal to the average observational error on the measured elonga- 
tions, i.e. ~ 0.5. The biased population makes a better job in re- 
producing observed measured elongations, with a KS significance 
level of 0.3 per cent compared to 0.1 per cent for the randomly 

orie nted clusters. 

iMeneghetti et al.l ( I2OIII) found that three clusters in the sam- 
ple, namely MACSJ07 17, MACSJ0025, and MACSJ2129, haveex- 
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Table 5. Inferred elongations of the z > 0.5 MACS sample. The 
lower CA, the more prominent the elongation along the line of sight. Cen- 
tral values and dispersions are computed as bi-weight estimators of the 
marginalised PDFs. 
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1.6 ±0.6 
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1.3 ±0.6 


J0257 


1.2 ±0.5 


J0451 


1.3 ±0.5 


J0647 


1.0 ±0.3 


J0717 


0.9 ±0.4 
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J0911 
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Figure 7. Theoretical probability density functions for the elongations of 
either randomly oriented (full line) or biased (CTcos e = 0.115, dashed line) 
JV-body like clusters versus measured values for the z > 0.5 MACS sample 
(normaUzed histogram). 




Figure 8. Predicted cuinulative distribution functions for elongation ver- 
sus measurements of the z > 0.5 MACS sainple. The step-line is for the 
observed sample. The smooth full and dashed lines are the predicted dis- 
tributions for A'^-body like clusters with either random or biased (ctcos e = 
0.115) orientations, respectively. 
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Figure 9. The same as Fig.|2]for the projected ellipticity. 




Figure 10. The same as Fig.[8]for the projected ellipticity. 

tremely large lensing cross sections. Triaxiality might help to un- 
derstand these outliers. In fact, two out the the three clusters, i.e. 
MACSJ0717 and MACSJ2129, have two of the smaller ca values 
(~ 0.9) in the sample. Convergence is proportional to I/ea, and 
elongation can enhance the cross section. MACSJ0717 is known 
to sit at the tip of a prominent filament, which further supports the 
view of an elongated structure. 

The same analysis can be applied to the measured ellipticities, 
see Table [3] Note that e is directly inferred from the strong lens- 
ing analysis whereas ba was estimated through the deprojection 
method. The observed versus the predicted values of the elliptici- 
ties are plotted in Fig.|9] The cumulative distributions are plotted in 
Fig. [To] We considered the same theoretical distributions as for the 
elongations. This time, the convolving normal function accounting 
for observational uncertainties had a dispersion of ~ 0.07. You can 
see by eye that the biased population can reproduce the observed 
values much better than the random one, in particular it predicts 
the observed excess at small values. In more quantitative terms, the 
KS significance level for the biased population is a robust 10 per 
cent, whereas for the random sample we got a negligible 2 x 10~* 
per cent. 

A further element in favour of an orientation bias is given by 
the analysis of the two-dimensional distributions, see Fig.[TT| Ran- 
domly oriented or biased populations in our analysis share the same 
axial ratio distribution. On the other hand, the orientation bias is 
compatible with a significant number of inclined clusters which are 
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Figure 11. Ellipticities and elongations of the MACS sample (crosses) 
versus the theoretical predictions. Contours are plotted at fraction val- 
ues exp(— 2.3/2), exp(— 6.17/2), and exp(— 11.8/2) of the maximum, 
which denote confidence limit region of 1 , 2 and 3(T in a maximum likeli- 
hood framework, respectively. The shadowed regions are for a population 
of A'^-body like haloes with a biased orientation (a^as e = 0.115); the thick 
contours are for randomly oriented clusters. 
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Figure 12. Estimated mass and concentration of the z > 0.5 MACS clus- 
ters versus theoretical predictions assuming priors for mass and concentra- 
tion flat in logarithmic bins. Symbols and styles are the same of Fig. [5] 

at the same time nearly circular when projected in the plane of the 
sky (small values of e) and quite elongated along the line of sight 
(small values of ba)- This feature is seen in the distribution of ob- 
served values. All pairs of measured e and ca are compatible within 
the 2a confidence level with the biased population. 



8 SYSTEMATICS 

Let us discuss some systematic uncertainties that might affect 
our analysis. The well-established modelling method used in 



Izitrin et al] ( l2011ah to identify many sets of multiply-lensed im- 
ages has been very successful in determining source redshifts. Fur- 
thermore, within the PixeLens approach, statistical uncertainties in 
the modelling due to unknown source redshifts (as far as the er- 
ror on Zs is < 0.5) are lower than degeneracies in the parame- 
ter sp ace due to variations in the mass density ( Saha & Williamsl 
120040 . Effects of redshift uncertainties are then expected to be neg- 
ligible. Firstly, over-(under) estimating the source redshift would 
decrease the lensing strength of the lens, and in turn the estima- 
tions of mass and concentration. This trend would not solve any 
over-concentration problem in ACDM, which would be artificially 
solved only for mass and concentration systematically changed in 
opposite directions. Secondly, the main effect of redshift uncertain- 
ties would be on the determination of the profile slope. Here, we 
were interested in comparison with numerical simulations which 
assumed a NFW profile for the matter halo. Then, we could keep 
the profile slope fixed and we did not determine it from the data. 
Thirdly, any variation of Szs ~ 0.5 for a typical lensed source at 
Zs ~ 2 would change the image position by 6-10 per cent. At a typ- 
ical Einstein radius of 30", this would translate into an astrometric 
error of < 2-3". Since we used a pixelated map to reproduce the 
mass density, the effect on parameter degeneracy is then negligible. 
In fact, our typical pixel resolution for the z > 0.5 MACS clusters 
is of ~ 3 — 4", see Table|2l which is worse than the redshift uncer- 
tainty. 

Substructure and line-of-sigh t haloes might also play a 
role to'Aloisio & Nataraian|[2011ah . A group-sized halo residing 
slightly behind the lens typically brings about an astrometric shift 
of ~ 0.25", with largest deviations of ~ 1". If the masses of the 
two line-of-sight structures are roughly equivalent the deviations 
increase to ~ 0.9" on average and can be as high as a few arc- 
seconds. The effect is more dramatic for uncorrelated line-of-sight 
structures which should perturb the image locations by < 1" for a 
lens at Zs ~ 0.5. These effects turns then out to be negligible when 
compared with our typical pixel resolution. 

A different source of error might come from the use of priors 
in the Bayesian analysis. If the likelihood is not peaked, final re- 
sult might reflect the employed a priori hypotheses. Whenever the 
parameter values are limited from above and below, as it is con- 
ceivably for the mass of MACS clusters, which are known to be 
very massive, and the concentration, which by definition is positive 
and very hardly exceeds very large values, flat priors in the allowed 
range, as the ones we used in Sec. (6) are the best choice. However, 
an alternative might be to use priors uniform in logarithmically- 
spaced decades, as usually done for parameters with only lower 
bounds. We then re-run the analysis with these alternative priors 
for mass and concentration. Results are summarised in Fig. [12] The 
agreement with the results obtained in Sec.[6]shows that our results 
were determined by the data and that the role of priors was sec- 
ondary. The only effect of logarithmically-spaced uniform priors 
was to favor slightly smaller masses. The shift was however much 
smaller than the estimated uncertainty, so that the determined trend 
for the c(M) relation is unaffected. 



9 CONCLUSIONS 

The full understanding of the interplay between mass and concen- 
tration in galaxy clusters is an open problem. The rich strong lens- 
ing features of the 2 > 0.5 MACS clusters offer the opportunity to 
study the c(M) relation in a well defined statistical sample. We 
exploited a full triaxial lensing analysis, where the matter halos 
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were modelled as triaxial NFW ellipsoids with arbitrary orienta- 
tions. The method is not affected by systematics related to shape, so 
that the resulting estimated concentrations should not suffer from 
any orientation bias. 

Our results are apt to comparison with A'^-body simulations. 
We find that the c{M) relation of the z > 0.5 MACS sample is in 
remarkable agreement with theoretical predictions. There is no hint 
to an over-concentration problem. The clusters turn out to be pref- 
erentially elongated along the line of sight, although this orientation 
bias is much smaller than for strong-lensing selected clusters. 

The obtained masses and concentrations are also in line with 
some general expectations based on the properties of the sample. 
The MACS clusters are X-ray selected. Gas distribution is usually 
rounder than the mat ter density, so triaxiality plays a smaller role 
in cluster selections dGavazz 1 120051) . The orientation bias as well 
as the over-concentration problem should be then sensibly reduced 
with respect to strong-lensing selected samples. On the other hand, 
the z > 0.5 MACS clusters are representative of the very high mass 
tail of the full cluster distribution, so that the role of a smaller bias 
might be amplified. We indeed retrieve such expected trends. 

With more and more reliable observational estimates of the 
c(M) relation, the onus seems now to be on the theoretical pre- 
dictions from A'^-body analyses. Dark matter A''-body simulations 
usually samples very few massive clusters and their predictions at 
the high-redshift and massive end are mainly based on extrapola- 
tions of results of low z haloes. Results from independent analyses 
are still in disagreement and estimate d concentrations f or massive 
clusters can differ by < 50 per cent dPrada et^l201 ih . Baryonic 
physics can play an important role too, especially in the very in- 
ner regions probed by strong lensing. The baryonic contribution 
to the assembling of halos in large simulation volumes has yet 
to be fully understood. The explicit effects of cooling, feedback, 
and baryonic-DM interplay on th e mass profile and concentra- 
tion are also still ambiguous (e.g., iGnedin et alJllOoi : IRozo et al.l 
l2008l : lDuffvetal.ll2010h . Continuous advancements in computa- 
tional power may soon enable sufficiently high-resolution descrip- 
tion to enlighten the true (theoretical) c(A/) relation for the most 
massive halos. 

Although we have found no discrepancy between theory and 
observations in the c(M) relation of the high-jz MACS sample, we 
note that this sample appears to consist of mostly unrelaxed clus- 
ters. The high X-ray luminosity, along with the relatively high red- 
shift and recent SZ effect images (via private communication), are 
in support of this claim, in addi tion to noticeable s ubstructure seen 
in optical and lensing analyses jZitrin et al.l201 1 j) . The inner mass 
distribution of (most of) these clusters seem correspondingly rather 
widely-distributed (see also Table [TJ, which may account for their 
extensive lensing properties and large Einstein radii. In this regard, 
the combined analysis of lensing data together with X-ray mea- 
surements and observations of the SZ effect seems to be a very 
promising approach to constrain the intrinsic stru cture of the mass 
and gas distributions even in unrelaxed clusters jPe Filippis et al.l 
|2005| - ISereno et al.ll2006l. I2OIII : iMorandi et alj|201l[) . 

Unlike relaxed and highly-concentrated clusters which cor- 
respondingly have more mass in the middle boosting their criti- 
cal area and Einstein radius, in higher-z, unrelaxed clusters, large 
Einstein radii may however form due to a widely-distributed in- 
ner mass distribution, so that the critical curves of the smaller sub- 
structures near the core m erge together to form a much larger Ein- 
stein r adius curve jTorri et al. 2004; Fedeli et al. 2006; Zitrin et al. 
l2009ah . If this is indeed the case, low concentrations and relatively 
shallow inner profiles should be expected for clusters as those in 



the z > 0.5 MACS sample jNeto et al.l2007h . Although the results 
of our analysis may not be projected on relaxed lower-z samples 
or unambiguously resolve the claimed discrepancy in the c(Af ) 
relation for relaxed halos, they are of an additional high impor- 
tance, since such information can help constrain and shed new light 
on the relaxation era of (massive) clusters, and more generally on 
the evolution of the large-scale structure. Our fi nding of low con- 
centrations sugges ts, as previously implied (e.g.. lNeto et alj|2007l ; 
IZitrin et al ]|201Ibh . that high-2 unrelaxed clusters therefore form a 
different class of prominent and impressive gravitational lenses. 

Our analysis would benefit from either arc redshift confirma- 
tion or data for different scales, as those provided by weak lensing 
observatio ns. Upcoming o bservations planned in the CLASH pro- 
gram (Post man et al.l201ll) are going to supply this information and 
will further establish the results of our work. However, combined 
strong plus weak lensing analyses may be dominated by the strong 
lensing part due to the lower systematics. Therefore, considering as 
a first step only strong lensing can be reasonable. 
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